AITopics

Country:

North America > United States (0.46)
North America > Canada > Quebec (0.28)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Sachan, Swati, Fickett, Dale S.

DeFi TrustBoost: Blockchain and AI for Trustworthy Decentralized Financial Decisions

arXiv.org Artificial IntelligenceDec-2-2025

This research introduces the Decentralized Finance (DeFi) TrustBoost Framework, which combines blockchain technology and Explainable AI to address challenges faced by lenders underwriting small business loan applications from low-wealth households. The framework is designed with a strong emphasis on fulfilling four crucial requirements of blockchain and AI systems: confidentiality, compliance with data protection laws, resistance to adversarial attacks, and compliance with regulatory audits. It presents a technique for tamper-proof auditing of automated AI decisions and a strategy for on-chain (inside-blockchain) and off-chain data storage to facilitate collaboration within and across financial organizations.

artificial intelligence, explanation, machine learning, (16 more...)

2512.00142

Country:

Europe (0.46)
North America > United States (0.28)

Genre: Research Report (0.40)

Industry:

Information Technology > Security & Privacy (1.00)
Banking & Finance > Trading (1.00)
Banking & Finance > Loans (1.00)

Technology:

Information Technology > e-Commerce > Financial Technology (1.00)
Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Neural Information Processing SystemsNov-21-2025, 12:18:43 GMT

Clustering Billions of Reads for DNA Data Storage

Cyrus Rashtchian, Konstantin Makarychev, Miklos Racz, Siena Ang, Djordje Jevdjic, Sergey Yekhanin, Luis Ceze, Karin Strauss

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, natural language, (17 more...)

Country:

Asia > Afghanistan > Parwan Province > Charikar (0.04)
South America > Peru > Cusco Department > Cusco Province > Cusco (0.04)
North America > United States > New York (0.04)
(3 more...)

Genre: Workflow (0.46)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Søren Dahlgaard, Mathias Knudsen, Mikkel Thorup

Practical Hash Functions for Similarity Estimation and Dimensionality Reduction

Neural Information Processing SystemsNov-21-2025, 09:59:06 GMT

Neural Information Processing Systems http://nips.cc/

data mining, hash function, machine learning, (18 more...)

Country:

Europe > Denmark > Capital Region > Copenhagen (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > Canada > Quebec > Capitale-Nationale Region > Québec (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Dimensionality Reduction (0.41)

Neural Information Processing SystemsAug-15-2025, 06:49:18 GMT

623307df18da128262aaf394cdcfb235-Paper-Conference.pdf

algorithm, cardinality, sketch, (15 more...)

Country: North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.46)

Industry: Information Technology > Security & Privacy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.46)

Karjalainen, Maiju, Miettinen, Pauli

Hashing for Fast Pattern Set Selection

arXiv.org Artificial IntelligenceJul-14-2025

Pattern set mining, which is the task of finding a good set of patterns instead of all patterns, is a fundamental problem in data mining. Many different definitions of what constitutes a good set have been proposed in recent years. In this paper, we consider the reconstruction error as a proxy measure for the goodness of the set, and concentrate on the adjacent problem of how to find a good set efficiently. We propose a method based on bottom-k hashing for efficiently selecting the set and extend the method for the common case where the patterns might only appear in approximate form in the data. Our approach has applications in tiling databases, Boolean matrix factorization, and redescription mining, among others. We show that our hashing-based approach is significantly faster than the standard greedy algorithm while obtaining almost equally good results in both synthetic and real-world data sets.

artificial intelligence, machine learning, pattern recognition, (17 more...)

2507.08745

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.35)

Busa-Fekete, Robert, Syed, Umar

Near-optimal algorithms for private estimation and sequential testing of collision probability

arXiv.org Machine LearningApr-18-2025

We present new algorithms for estimating and testing \emph{collision probability}, a fundamental measure of the spread of a discrete distribution that is widely used in many scientific fields. We describe an algorithm that satisfies $(\alpha, \beta)$-local differential privacy and estimates collision probability with error at most $\epsilon$ using $\tilde{O}\left(\frac{\log(1/\beta)}{\alpha^2 \epsilon^2}\right)$ samples for $\alpha \le 1$, which improves over previous work by a factor of $\frac{1}{\alpha^2}$. We also present a sequential testing algorithm for collision probability, which can distinguish between collision probability values that are separated by $\epsilon$ using $\tilde{O}(\frac{1}{\epsilon^2})$ samples, even when $\epsilon$ is unknown. Our algorithms have nearly the optimal sample complexity, and in experiments we show that they require significantly fewer samples than previous methods.

artificial intelligence, data mining, machine learning, (17 more...)

arXiv.org Machine Learning

2504.13804

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Security & Privacy (0.67)
Information Technology > Data Science > Data Mining > Big Data (0.46)

Kang, Junhyeok, Shin, Yooju, Lee, Jae-Gil

VarDrop: Enhancing Training Efficiency by Reducing Variate Redundancy in Periodic Time Series Forecasting

arXiv.org Artificial IntelligenceFeb-3-2025

Variate tokenization, which independently embeds each variate as separate tokens, has achieved remarkable improvements in multivariate time series forecasting. However, employing self-attention with variate tokens incurs a quadratic computational cost with respect to the number of variates, thus limiting its training efficiency for large-scale applications. To address this issue, we propose VarDrop, a simple yet efficient strategy that reduces the token usage by omitting redundant variate tokens during training. VarDrop adaptively excludes redundant tokens within a given batch, thereby reducing the number of tokens used for dot-product attention while preserving essential information. Specifically, we introduce k-dominant frequency hashing (k-DFH), which utilizes the ranked dominant frequencies in the frequency domain as a hash value to efficiently group variate tokens exhibiting similar periodic behaviors. Then, only representative tokens in each group are sampled through stratified sampling. By performing sparse attention with these selected tokens, the computational cost of scaled dot-product attention is significantly alleviated. Experiments conducted on public benchmark datasets demonstrate that VarDrop outperforms existing efficient baselines.

artificial intelligence, data mining, machine learning, (19 more...)

2501.14183

Country: North America > United States > Alabama (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Energy (0.71)

Technology:

Information Technology > Data Science > Data Mining (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Liu, Wenbo, Chen, Handi, Ngai, Edith C. H.

BF-Meta: Secure Blockchain-enhanced Privacy-preserving Federated Learning for Metaverse

arXiv.org Artificial IntelligenceOct-28-2024

The metaverse, emerging as a revolutionary platform for social and economic activities, provides various virtual services while posing security and privacy challenges. Wearable devices serve as bridges between the real world and the metaverse. To provide intelligent services without revealing users' privacy in the metaverse, leveraging federated learning (FL) to train models on local wearable devices is a promising solution. However, centralized model aggregation in traditional FL may suffer from external attacks, resulting in a single point of failure. Furthermore, the absence of incentive mechanisms may weaken users' participation during FL training, leading to degraded performance of the trained model and reduced quality of intelligent services. In this paper, we propose BF-Meta, a secure blockchain-empowered FL framework with decentralized model aggregation, to mitigate the negative influence of malicious users and provide secure virtual services in the metaverse. In addition, we design an incentive mechanism to give feedback to users based on their behaviors. Experiments conducted on five datasets demonstrate the effectiveness and applicability of BF-Meta.

artificial intelligence, bf-meta, machine learning, (18 more...)